An improved memristor model for brain-inspired computing
Zhou Errui, Fang Liang, Liu Rulin, Tang Zhenseng
State Key Laboratory of High Performance Computing, College of Computer, National University of Defense Technology, Changsha 410073, China

 

† Corresponding author. E-mail: lfang@nudt.edu.cn

Project supported by the National Natural Science Foundation of China (Grant No. 61332003) and High Performance Computing Laboratory, China (Grant No. 201501-02).

Abstract

Memristors, as memristive devices, have received a great deal of interest since being fabricated by HP labs. The forgetting effect that has significant influences on memristors’ performance has to be taken into account when they are employed. It is significant to build a good model that can express the forgetting effect well for application researches due to its promising prospects in brain-inspired computing. Some models are proposed to represent the forgetting effect but do not work well. In this paper, we present a novel window function, which has good performance in a drift model. We analyze the deficiencies of the previous drift diffusion models for the forgetting effect and propose an improved model. Moreover, the improved model is exploited as a synapse model in spiking neural networks to recognize digit images. Simulation results show that the improved model overcomes the defects of the previous models and can be used as a synapse model in brain-inspired computing due to its synaptic characteristics. The results also indicate that the improved model can express the forgetting effect better when it is employed in spiking neural networks, which means that more appropriate evaluations can be obtained in applications.

1. Introduction

The memristor, called the fourth fundamental element besides resistor, capacitor, and inductor, was predicted by Prof. Chua theoretically in 1971.[1] It has received broad attention since being discovered by HP labs in 2008.[2] The memristor has broad prospects in many fields: it is a promising candidate for nonvolatile memory and may change the existing computer structure,[35] and it can be employed in image processing and other fields.[6,7] What is more, it is a promising candidate for synaptic devices in brain-inspired computing.[810] In this context, it is significant to build a good model for memristor application research.

HP labs presented a drift model when they discovered the first memristor.[2] However, HP’s model has some significant deficiencies; several window functions have been proposed to improve it. Joglekar et al. proposed a window function to overcome the terminal overflow; Biolek et al. proposed a window function to solve the terminal lock and so on.[1114] However, these window functions have their blemishes more or less. In this paper, a novel window function is proposed. It can overcome the terminal overflow and terminal lock and has good flexibility.

In later studies, researchers found the forgetting effect in several experiments, which is also called short term memory or volatile memory.[1517] In order to describe this phenomenon, many models have been proposed.[15,18,19] Berdan et al. and Ling Chen et al. proposed different drift diffusion models based on research results of Strukov.[2022] Though both models can describe the forgetting effect to some extent, they have some deficiencies that the forgetting effect in terminal (x = 0, 1) cannot be embodied in some situation that is common in fact. The forgetting effect has essential influences on accuracies and functions of applications, such as non-volatile memory and brain-inspired computing based on memristors. When a memristor model that cannot express the forgetting effect well is employed in simulations, it may cause serious errors. In this paper, we present an improved drift diffusion model to overcome shortcomings of the previous models. The threshold characteristic is added to our model since the existence of threshold has been discovered in several memristors. Spike-timing-dependent plasticity (STDP) of our model is verified, which means that the improved model can be employed as a synapse model. Moreover, the improved model is used in spiking neural networks (SNNs) to prove its merits. Simulation results indicate that the improved model can overcome deficiencies of the previous works and it can express the forgetting effect better when it is used as the synapse model in SNNs.

2. Window function
2.1. Existing window functions

The drift model proposed by HP labs is shown in Fig. 1. It can be seen that there are two regions in a memristor: the doped region and the undoped region, and there is a boundary between them. D is the length of the memristor and w is the length of the doped region.

Fig. 1. HP’s drift model.

The model is described as where Ron is the resistance when the memristor is completely doped, and Roff is the resistance when the memristor is undoped. x = w/D is the ratio of the doped region, v(t) is the voltage applied on the memristor, i(t) is the current flowing through the memristor, and μv is the mobility of oxygen vacancies. HP’s model can describe characteristics of memristors to some extent, but it has many defects. A significant one is that x can overflow the scale of [0, 1] when there is a continuous positive (negative) stimulus on the memristor, also called the terminal overflow.

In order to improve HP’s drift model, many window functions have been proposed. Jogleka’s window function is expressed as Then expression (3) is modified to This window function makes x not move when x = 0, 1, i.e., dx/dt = 0, so x cannot overflow the scale of [0, 1], as shown in Fig. 2(a). However, there is a problem that x can no longer move when x = 0, 1 even though there is a reversed stimulus, which is also called terminal lock. Biolek et al. proposed the following window function to solve the terminal lock: Biolek’s window function takes the current direction into account, as shown in Fig. 2(b). When i(t) > 0, expression (6) turns into f(x) = 1 − x2p, and f(0) = 1, f(1) = 0. When i(t) ≤ 0, expression (6) turns into f(x) = 1− (x − 1)2p, and f(0) = 0, f(1) = 1. Biolek’s window function solves the terminal overflow and terminal lock. However, the parameter p must have positive integers, which limits the flexibility of the window function. Then Prodomakis et al. proposed a window function which has good flexibility and overcomes the terminal overflow, but cannot solve the terminal lock. Other window functions are being proposed continuously. However, their advantages are accompanied by disadvantages.

Fig. 2. (color online) (a) Jogleka’s window function and (b) Biolek’s window function.
2.2. Novel window function

It is easy to know that function y = 1/x is a decreasing function in (0, + ∞). So it can be exploited as a basic model of the window function, and the model can be expressed as f(x) = 1/(ax + b) + c. When the stimulus is positive and x increases continuously, it is necessary to make f(0) = 1 and f(1) = 0. A set of solutions are a = b = 1/2 and c = −1, the function is f1(x) = 2/(x + 1) − 1. When the stimulus is negtive and x decreases continuously, it is easy to obtain f2(x) = 2/(2 − x) −1. Then considering f1(x) and f2(x) comprehensively, we introduce In order to increase the flexibility of expression (9), exponent p and coefficient j are added, p ≥ 0, j > 0. Then we obtain the final window function

As shown in Fig. 3(a), curves with different p have different trends, i.e., a broad range of adjustment. In Fig. 3(b), the novel window function is used in HP’s drift model. It can be seen that terminal overflow and terminal lock are both solved.

Fig. 3. (color online) (a) The proposed window function and (b) the simulation results in HP’s drift model, j = p = 1.

In short, our novel window function has obvious advantages compared to the previous ones. It uses two parameters and two variables to overcome all shortcomings, including the terminal overflow and terminal lock. The scale of p is [0, + ∞), which means great flexibility. And j can modify the scale of f(x).

3. Memristor model
3.1. Existing drift diffusion models

Researchers have found the forgetting effect in several memristor experiments. Some models have been proposed to describe this phenomenon. Berdan and Chen proposed different drift diffusion models based on HP’s drift model and the research results of Strukov.

Berdan’s model is expressed as Expression (14) is the same as HP’s model, expressions (12) and (13) are added. It is easy to find that y is a reference value for x. z is used to limit the change of y, i.e., y changes its value until z exceeds thresholds. −(xy)/Rx indicates the effects of mass diffusion and the nucleation process on the drift. By analyzing expression (11), it is easy to find that dx/dt < 0 when x = 1, i(t) ≫ 0 (also means I0(x) = 0), and y < 1. This means that x changes negatively though stimulus is still a great positive one. After a certain negative change (x < 1), x changes positively again. There is an oscillation for x, which is not coincident with the memristors. What is more, x no longer changes when x = y = 1 and no more stimulus is supplied. This means that the forgetting effect cannot be described. When x = 0, there are similar problems. The simulation results are shown in Fig. 4, it can be seen that both x and dx/dt have oscillation behaviors, and x can no longer change (or decay) when x = y = 1 and no more stimulus is supplied. In brief, Berdan’s model has defects when x = 0, 1.

Fig. 4. (color online) (a) Oscillation of x. x remains 1 when x = y = 1, v = 0. (b) Oscillation of dx/dt. Parameters are the same as those in Ref. [17].

Ling Chen’s is expressed as The −(xε)/τ indicates effects of Fick diffusion and Soret diffusion on the drift. The Fick diffusion has negtive effects and the Soret diffusion has positive effects. Expression (16) expresses the result of Soret diffusion, and expression (17) indicates the decay speed. By analyzing expression (15), it is easy to find that dx/dt is limited totally by the window function. As shown in Fig. 5, when x = 1, i(t) ≥ 0, and f(x) = 0, x is impossible to change (or decay), even though l − (xε)/τ < 0. When x = 0, there is a similar problem. In short, Chen’s model also has blemishes when x = 0, 1.

Fig. 5. (color online) x remains 1 when x = 1, i(t) ≥ 0, f(x) = 0. The parrameters are the same as those in Ref. [18].
3.2. Improved drift diffusion model

Obviously, both Berdan’s model and Chen’s models have deficiencies when x = 0, 1. In this section, an improved drift diffusion model is proposed to overcome problems on terminals (x = 0, 1). According to the discussion above, it is easy to find that the limitation of the window function should be decided by l−(x − ε)/τ (or l−(xy)/Rx) when x = 0, 1. The improved model can be expressed as where ω is the decay speed, which can be a constant or an expression. In this paper, ω is set as a constant by referencing to Ref. [18]. α and σ are parameters and ε is the final stable value of x. The main difference of the improved model is expression (19). When x = 1 and i(t)≥ q 0, if lω(x−ε) < 0, dx/dt is not limited by the window function. When x = 0 and i(t) ≤ 0, if lω(x-ε) > 0, dx/dt is not limited by the window function, either. In other cases, dx/dt is limited by the window function normally.

The threshold characteristic has been discovered in several memristors. In order to build a good model, taking the threshold into account is necessary. Expression (18) can turn into when l is limited by threshold function g(v). vt is the threshold voltage and vt > 0. If the absolute value of stimulus is greater than vt, x changes, otherwise x remains unchanged.

Simulation results of the improved model are shown in Figs. 6 and 7. The window function used in the model is the one proposed in Subssection 2.2. It can be seen that there is an obvious overlap in Figs. 6(a) and 6(c), which means that the improved model can describe the forgetting effects well. Changes of current, resistance, and x with time are shown in Figs. 6(b) and 6(d). In Figs. 7(a) and 7(b), a stimulus of 0.3 V for 50s and 0.01 V for 50 s is applied. When v = 0.3 V and x < 1, the stimulus is stronger than diffusion, dx/dt > 0 and x increases. When v = 0.3 V and x = 1, the stimulus is stronger than diffusion, but dx/dt = 0 and x remains unchanged. When v = 0.01 V, the stimulus is weaker than diffusion, dx/dt < 0 and x decreases. It is easy to find that x remains unchanged when |v| < vt, as shown in Fig. 7(c).

Fig. 6. (color online) Curves of (a) iv, (b) vt and it, (c) rv, (d) rt and xt of the improved model. Parameters are μv = 1 × 10−14 m/V2, D = 10 nm, Ron = 100 Ω, Roff = 16 k Ω, vt = 0.01 V, ε0 = x0 = 0.5, ω = 2, σ = 0.1, α = 0.01, j = p = 1.
Fig. 7. (color online) Curves of (a) dx/dtt and (b) xt of the improved model. Parameters are the same as those in Fig. 6 and a stimulus of 0.3 V for 50 s and 0.01 V for 50 s is applied. (c) xt curve, a sine stimulus is applied, vt = 0.5 V and the other parameters are the same as those in Fig. 6.

In brief, the improved model overcomes the defects of Berdan’s model and Ling Chen’s model and can describe the threshold characteristic. So this model can describe the characteristics of memristors better and be used.

3.3 STDP of the improved model

It has been found that memristors have synaptic characteristics and can be used as synapses, this is why memristors are used broadly in brain-inspired computing. Spike-timing-dependent plasticity (STDP) is a significant characteristic of synapses, and is usually used as an evaluation standard of artificial synapses. When a memristor model is used as a synapse model, it is important to have STDP.

As shown in Fig. 8, two spikes, pre-spike and post-spike, are supplied to the improved memristor model. There is a ΔT between achieving time of the two. Simulation results are shown in Fig. 9. It can be seen that the weight change (ΔW = ΔG/G0 = R0/R−1) of the improved model is dependent on ΔT. What is more, the curve coincides with the standard STDP curve of a biological synapse very well. It means that the improved model can simulate synaptic characteristics well and can be used as a synapse model in brain-inspired computing.

Fig. 8. (color online) (a) Pre-spike and (b) post-spike.
Fig. 9. (color online) Simulation results of STDP learning of the improved model, vt = 0.6 V and the other parameters are the same as those in Fig. 6.
4. Application in SNNs

Since memristors can be employed as synapses, composing SNNs with neurons for pattern recognition is an important application of memristors in brain-inspired computing. To demonstrate that the improved model can represent the forgetting effect better in application studies, SNNs based on the improved model, Berdan’s model, Chen’s model, and the model without forgetting effect respectively are employed to recognize digit images of “0”–“9”. As shown in Fig. 10(a), there are 15 input neurons (pre_0–pre_14) and 10 output neurons (post_0–post_9) in the SNNs. Additionally, there is a memristor between each pair of input neurons and output neurons. Digit images of “0”–“9” are composed by 15 blocks, which are depicted in Fig. 10(b). The neuron is a kind of integrate-and-fire, which integrates charges from pre-synapses and generates a voltage spike when the voltage corresponding to the integrated charges exceeds a threshold voltage.

Fig. 10. (a) SNNs based on memristors and neurons, and (b) digit images of “0”–“9”.

Before training, the states of all memristors are mid-resistance, which means that the models are set as x0 = 0.5. During the training process, the memristors corresponding to the black blocks are trained to LRS and the memristors corresponding to the white blocks are trained to HRS, which means that all memristors are in the terminal state. The states of the memristors after training are shown in Fig. 11(a). During the recognizing process, when a digit image is supplied as the input, the input neurons corresponding to the black blocks generate a square pulse and send it to the output neurons via memristors, the input neurons corresponding to the white blocks generate nothing. Then, the integrated voltage of post_0–post_9 changes with time, and recognition is finished when there is a post neuron exceeding the threshold voltage.

Fig. 11. (color online) (a) States of memristors with different models after training, (b) results of recognizing image “1” with states of memristors in (a), (c) states of memristors with the improved model after forgetting, and (d) results of recognizing image “1” with states of memristors in (c).

Recognizing image “1” from other images is shown as an example. As shown in Fig. 11(b), post_1 exceeds the threshold voltage first with the memristors after training. After 80 s, the states of the memristors described by the improved model are shown in Fig. 11(c), which indicates that the resistance of the memristors changes due to the forgetting effect. As shown in Fig. 11(d), the voltage changes more smoothly and the maximum voltage that post_1 can reach is lower than that in Fig. 11(b). However, when Berdan’s model, Chen’s model, and the model without forgetting effect are employed in the SNNs, the states of the memristors and the integrated voltages of the output neurons have no difference compared to those in Fig. 11(b), which means that the forgetting effect is not present at all. The recognition of other digit images gets similar results.

When the forgetting effect can influence memristors’ states, taking the forgetting effect into account is essential when the memristor model is employed as the synapse in the studies of brain-inspired computing. From the simulation results, we can find that the improved model can express the forgetting effect of memristors better than the other models. The results also indicate that there will be an inaccurate evaluation of recognition if the other models are used to simulate the forgetting effect, but a more appropriate evaluation can be obtained when the improved model is used. Therefore, the improved model can work better in the situations that the forgetting effect plays a significant role.

5. Conclusion

A novel window function is proposed and has good performance. Deficiencies on x = 0, 1 of the previous drift diffusion models are analyzed, and an improved drift diffusion model is presented. The improved model deals well with influences of diffusion on drift when x = 0, 1. The threshold characteristic is added to the improved model to make it more common and the STDP curve of the improved model coincides with the biological synapse well. Different models are used as synapses respectively in SNNs to recognize digit images to examine their performances in representing the forgetting effect. Simulation results show that the improved model can express the forgetting effect better than the previous models.

Reference
[1] Chua L O 1971 IEEE Tran. Circ. Th. 18 507
[2] Strukov D B Snider G S Stewart D R Williams R S 2008 Nature 453 80
[3] Zhou J Yang X J Wu J J Zhu X Fang X D Huang D 2013 Sci. China: Inf. Sci. 56 1
[4] Zhu X Tang Y H Wu C Q Wu J J Yi X 2014 Chin. Phys. 23 028501
[5] Su S Jian X C Wang F Han Y M Tian Y X Wang X Y Zhang H Z Zhang K L 2016 Chin. Phys. 25 107302
[6] Li Z J Zheng Y C 2013 Chin. Phys. 22 040502
[7] Huang D Wu J J Tang Y H 2014 Chin. Phys. 23 038404
[8] Jo S H Chang T Ebong I Bhadviya B B Mazumder P Lu W 2010 Nano Lett. 10 1297
[9] Covi E Brivio S Serb A Prodromakis T Fanciulli M Spiga S 2016 Front. Neurosci. 10 482
[10] Zhang Y Li Y Wang X P Friedman E G 2017 IEEE Tran. Elec. Dev. 64 1806
[11] Joglekar Y N Wolf S J 2009 Eur. J. Phys. 30 661
[12] Biolek Z Biolek D Biolkova V 2009 Radioengineering 18 210
[13] Prodromakis T Peh B P Papavassiliou C Toumazou C 2011 IEEE Tran. Elec. Dev. 58 3099
[14] Zha J X Huang H Liu Y J 2016 IEEE Tran. Circuits Syst. II: Exp. Briefs 63 423
[15] Chang T Jo S H Kim K H Sheridan P Gaba S Lu W 2011 Appl. Phys. 102 857
[16] Chang T Jo S H Lu W 2011 ACS Nano 5 7669
[17] Wang Z Q Xu H Y Li X H Yu H Liu Y C Zhu X J 2012 Adv. Funct. Mater. 22 2759
[18] Meng F Y Duan S K Wang L D Hu X F Dong Z K 2015 Acta Phys. Sin. 64 148501 in Chinese
[19] Shao N Zhang S B Shao S Y 2016 Acta Phys. Sin. 65 128503 in Chinese
[20] Strukov D B Alibart F Wiliams R S 2012 Appl. Phys. 107 509
[21] Berdan R Lim C Khiat A Papavassiliou C 2014 IEEE Elec. Dev. Lett. 35 135
[22] Chen L Li C D Huang T W Ahmad H G Chen Y R 2014 Phys. Lett. 378 2924